Reducing Cache Misses in Numerical Applications Using Data Relocation and Prefetching
نویسندگان
چکیده
Numerical applications frequently contain nested loops that process large arrays of data. The execution of these loop structures often produces memory reference patterns that utilize data caches poorly. Indeed, poor reuse of the data, large working set sizes, and frequent non-unit stride accesses all combine to cause many cache misses. To improve cache performance, data copying has been proposed. However, this technique has high overhead. In this paper, instead, we propose a combined hardware and software technique called data relocation and prefetching which eliminates much of the overhead of data copying through the use of special hardware. Furthermore, by relocating the data while performing software prefetching, the overhead of copying the data can be reduced further. This technique performs better than prefetching alone because it reduces cache misses through relocation, and it reduces overhead by prefetching multiple elements at once. The hardware is designed to overlap relocation and prefetching with normal execution, and to highly utilize the available bus bandwidth. Simulation results show that this technique greatly reduces data cache miss rates. As a result, large applications including PERFECT and SPEC benchmarks achieve up to 2.5 times speedup. The hardware support required by this technique has been greatly re ned over that presented in an earlier paper. Index terms Cache con icts, data copying, data relocation, program optimization, software prefetching.
منابع مشابه
Miss Penalty Reduction Using Bundled Capacity Prefetching in Multiprocessors
While prefetch has proven itself useful for reducing cache misses in multiprocessors, traffic is often increased due to extra unused prefetch data. Prefetching in multiprocessors can also increase the cache miss rate due to the false sharing caused by the larger pieces of data retrieved. The capacity prefetching strategy proposed in this paper is built on the assumption that prefetching is most...
متن کاملECE1718 Project Final Report Improving Data Locality During Thread-Level Speculation
Locality conflict is a major problem during thread-level speculation (TLS). This paper addresses three potential techniques for reducing data cache misses, namely universal prefetching, ORB prefetching and prefetching on speculative violation. Universal prefetching works by prefetching clean cache lines from the unified cache to all data caches when one of the data caches suffer a speculative r...
متن کاملImproving the Data Cache Performance of Multiprocessor Operating Systems
Bus-based shared-memory multiprocessors with coherent caches have recently become very popular. To achieve high performance, these systems rely on increasingly sophisticated cache hierarchies. However, while these machines often run loads with substantial operating system activity, performance measurements have consistently indicated that the operating system uses the data cache hierarchy poorl...
متن کاملRedesigning Database Systems in Light of CPU Cache Prefetching
Computer systems have enjoyed an exponential growth in processor speed for the past 20 years, while main memory speed has improved only moderately. Today a cache miss to main memory takes hundreds of processor cycles. Recent studies have demonstrated that on commercial databases, about 50% or more of execution time in memory is often wasted due to cache misses. In light of this problem, a numbe...
متن کاملComparing Data Forwarding and Prefetchingfor Communication - Induced Misses in Shared - Memory MPs 1
As the diierence in speed between processor and memory system continues to increase, it is becoming crucial to develop and reene techniques that enhance the eeectiveness of cache hierarchies. Two such techniques are data prefetching and data forwarding. With prefetching, a processor hides the la-tency of cache misses by requesting the data before it actually needs it. With forwarding, a produce...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995